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ABSTRACT 



Conditional cumulants form a set of unique statistics which represent a sensible 
compromise between ./V-point correlation functions and cumulants measured from mo- 
ments of counts in cells. They share accurate edge corrected estimators with iV-point 
correlation functions, yet, they are as straightforward to measure and interpret as 
counts in cells. The conditional cumulants have three equivalent views as i) degener- 
ate TV-point correlation functions ii) or integrated monopole moments of the bispec- 
trum iii) they are closely related to neighbour counts. We compute the predictions of 
weakly non-linear perturbation theory for conditional cumulants and compare them 
with measurements in simulations, both in real and redshift space. We find excellent 
agreement between theory and simulations, especially on scales > 20h~ 1 Mpc. Due to 
their advantageous statistical properties and well understood dynamics, we propose 
conditional cumulants as tools for high precision cosmology. Potential applications 
include constraining bias and redshift distortions from galaxy redshift surveys. 
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1 INTRODUCTION 

Large scale structure statistics of higher than second order 
contain a wealth of information on cosmological parame- 
ters, gravitational amplification of initial fluctuations, and 
structure formation in general. In particular, higher order 
statistics have the potential to provide som e of the bes t 
constraints on the phenomenon of "biasing " l|Kaiserlll984l . 
Wh ile the core id eas have been known for over two decades 
(cf. iPecblcs 1980), the latest wide field ga laxy surveys, such 



as the Sloan Digital Sky Survey (e.g. . lYork et alJ 
SDSS), and the 2 degree Filed Survey llColless et al.l 



2000, 



2001, 



2dF) have motivated a concerted effort to enhance theories 
and techniques of higher order statistics t o the level of "high 
preci sion cosmology" (for summary, see iBernardeau et all 

Counts in cells (CIC) and related statistics have 
the most well understood theoretical background 
and consequently yielded some of the most su ccess- 
ful applications (e.g., |Gaztaiuig^^F^neman| 
ISzapudi. Meiksin fc Nichoil Il996t ISzapudi et alJ 12002ft . 
The estimation methods and the c orresponding errors 
have been worked out in detail (e.g. ISzapudi fc Colombil 
Il996t ISzapudi et alJll999h . But when CIC are measured to 
constrain theories at a few percent level, edge effects present 
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a major problem due to the complex g eometry and cut out 
holes of realistic surveys. As shown by iSzapudi fc Colombil 
(1996), edge effects cannot be corrected for exactly; an 
approximate es timator exists if the shape dependence of 
counts is weak JSzapudilll998bl) . 

On the other hand, full edge effect correction is possi- 
ble wh en using the class of estimators bv ISzapudi fc Szalavl 
( 1998) for the iV-point correlation functions. JV-point cor- 
relation functions are, however, inherently more compli- 
cated objects than cumulants of CIC. They depend on a 
large number of par ameters, their mea surement is compu- 
tationally intensive jMoore et aljl200lh . and consequently, 
their interpretation is difficult. Much of the theoretical ef- 
fort has been concentrated on the three-point function, 
yet weakly non-linear perturbation theory and halo models 
have only limited success when contrasted with s imulations 
jBarriga fc GaztafiagalkoollTakada fc Jainll2003lh Redshift 
distortions present an even more formidable challenge. 

The goal of the present paper is to explore a set of statis- 
tics, the conditional cumulants, which combine the simplic- 
ity and transparency of cumulants with the edge effect cor- 
rected estimator of the iV-point correlation. They can be 
viewed as degenerate iV-point correlation functions , or inte- 
grals of the monopole moment of the bispectrum iSzapudil 
12004) . They are als o closely relat ed to (factorial) moments of 
neighbour counts <Peebledll98ch . While previously known, 
neighbour counts have bee n used fairly in frequently in com- 
parison to cumulants (e.g.. lBorganilll995l) . The terminology 



2 Jun Pan and Istvdn Szapudi 



condit ional cumulants was introduced bv lBonometto et alJ 
(1995). They have used an estimator based on moments of 
neighbour counts and developed a theory under the assump- 
tions of stable clustering and scale invariance. 

In §2, we present formal definition of the conditional 
cumulants and their relevant properties. Predictions of the 
third order conditional cumulant in the weakly non-linear 
regime are d escribed in §3. In §4 , we ad apt the edge corrected 
estimator bv lSzaoudi fc Szalavl l)l998F) . and compare predic- 
tions with measurements in simulations. In §5, we summarise 
results, present the theory in redshift space together with 
simulation results, and discuss implications for estimating 
bias. 



2 CONDITIONAL CUMULANTS 

Conditional cumulants are defined as the joint connected 
moment of one unsmoothed and N — 1 smoothed density 
fluctuation fields. They are realised by integrals of the TV- 
point correlation function through TV — 1 spherical top-hat 
windows, 
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where Vi = 4-7r/3rf is the volume of the window func- 
tion W Ti . In the most general case, each top hat window 
might have a different radius. Further simplification arises if 
all the top hats are the same, i.e. we define Un{t) with 
ri = ... = rjy_i = r as the conditional cumulant (cf. 
iBonometto et all Il995f) . The Un subtly differs from the 
usual cumulant of smoothed field £ N by one less integral 
over the window function. 

The second order, U2, is equivalent to the confusingly 



Ine second order, U2, is cquivalcn 
named J3 integral fe.g.. |Peeblesl ll980'l. 



U 2 (r) = ^J 3 (r 



(2*) 



P(k)w{kr)^k 2 dk , (2) 



where w(kr) — 3(sinfcr — kr cos kr)/(kr) 3 is the Fourier 
transform of W r (s), and P(k) is the power spectrum. 

For higher orders, we can construct reduced conditional 
cumulants as 



R N (r) = 



Un(t) 
" L ( r ) 



U 



(3) 



Un and Rn hav e deep connec tion with moments of 
neighbour counts (e.g. Peebles 1983). Let us define the par- 
tition function Z[J\ = {exp(f iJp)) (cf. ISzapudi fc Szalavl 
1993), where p is the smoothed density field. Then we can 
use the special source function ij(x) = W(x)s + 5o{x)t to 
obtain the generating function G(s, t). This is related to the 
generating function of neighbour counts factorial moments 
as G(s) = dtG(s,t)\ t =o- The final result is 
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where nV = TV is the average count of galaxies, and 
= f/i = 1 by definition. This generating function can 
be used to obtain Un's and/or Rn's from neighbour counts 
factorial moments analogously as the generating functions 



in ISzapudi fc Szalavl lll993fi for obtaining Sjv's from facto- 
rial moments of CIC. For completeness, the generating func- 
tion for neighbour counts distribution is obtained by substi- 
tuting s — > s — 1, while the ordinary moment generating 
function by s — » e " — 1. W e checked that the we recover for- 
mulae of iPeebles! Jl980l) . §36 from G(e s - 1). The above 
generating function facilitates the extra ction of Un from 
neighb our counts statistics. For details see lSzapudi fc Szalavl 
(1993): the entire theory for CIC can be adapted to neigh- 
bour counts. So far our discussion has been general; in what 
follows we will focus on the first non-trivial conditional cu- 
mulant U3. 

Uz[r\,r2) is simply related to bispectrum by 



U 3 (ri,r 2 ) = J B(ki,k2 J k 3 )5£>(k 1 +k 2 + k 3 ) 
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where Sd is the Dirac delta function. To further elucidate the 
above relation, we use the multi-pole expansion of the bis- 
pec trum and the three point correlation function proposed 
by JSzapudibOol) 
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where cos9 is ki ^2/(^1^2) or ri ■rz/irirz), and Pi are Leg- 
endre polynomials. The multi-pole moments can be obtained 
as Bi = 2n J BPidcos9, Q = 2n f (Pidcos9. Substituting 
into the general equation, we find 
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in which j\ is the first order spherical Bessel function. We 
see the U3 depends only on the monopole moment of the 
bispectrum/three-point correlation function. This property 
significantly simplifies the transformation of the statistics 
under redshift distortions. 



3 Ua IN WEAKLY NON-LINEAR REGIME 

On large scales, where the fluctuations are reasonably 
small, clustering of cosmic structures can be understood 
in Eulerian weakly non -linear perturbation theory (EPT) 
(Bcrnardc au et al.ll20 02. and references therein). To predict 
the behaviour of U 3 from Gaussian initial conditions, we 
assume an expansion of the density field into first, second, 
etc. order, 5 = S w + <5 (2) + . . .. Then EPT can be used to 
calculate the leading order contribution to U3 = (S(0)8 2 ) c , 
where 8 r is the density field filtered at the scale r, and () c 
means connected momen t. We use the second-order EPT 
kernel JCoroff et alj ll986) 
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the linear power spectrum P(k), and integrals of the kernel 
multiplied with the top-hat window function dBernardeaul 
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Figure 1. Predictions of <7 2 (r) on large scales in real space (solid 
line) compared with measurements in N-body simulations (trian- 
gles with error-bars) with ACDM cosmology. 



Il994al) to finally obtain 
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where a 2 — / k 2 P(k)w 2 (kr)dk. The above equations 
constitute the main results of this paper. Note the similar- 
ity of -R3 with the skewness, which is calculated in weakly 
non-linear perturbation theory as S3 = 34/7 — dlng 2 /dln r 
iJuszkiewicz. Bouchet fc Colombll993tlBernardeaull994bl) . 



4 MEASUREMENTS 

U„ (r) can be measured similarly to Appoint correlation func- 
tions. For instance U2 can be thought of as a two point cor- 
relation function in a bin [ri ,rhi] = [0,r]. Taking the lower 
limit to be a very small number instead of 0, one can correct 
for discreteness effects due to self counting (this is equiva- 
lent to using factorial moments when neighbour counts are 
calculated directly) . Give n a set of data and rando m points, 
the class of estimators of Sza pudi fc Szalavl lll99ct) will pro- 
vide an edge (and incompleteness) corrected technique to 
measure conditional cumulants 



(D — R) n 
R" 



(11) 



Existing iV-point correlation function codes can be used for 
the estimation; for higher then third order, one also has to 
take connected moments in the usual way. 

While the above suggests a scaling similar to Appoint 



correlation functions, the relation to neighbour count facto- 
rial moments outlined in the previous section can be used 
to realise the estimator using two-point correlation function 
codes. To develop such an estimator, neighbour count facto- 
rial moments need to be collected for each possible combina- 
tions where data and random points play the role of centre 
and neighbour. 

Note that the edge correction of Eq. Hil l is expected to 
be less accurate for conditional cumulants than for Appoint 
correlation functions, however, the estimator will be more 
accurate than CIC estimators. Several alternative ways for 
correcting edge effects are known, w hich would be directly 
applicable to conditional cumulants llRiplevlll988l : IKerscherl 
ll999UPan fc Coledliooi) . In what follows, we use Eq. |TTJ 
for all results presented. Future high precision measurements 
could benefit fr om a shootout of possible estimators as in 
IKerscherl Jl999l) . 

To test our theory, we performed measurements in 
ACDM simulations by the Virgo Supercomputing Consor- 
tium. We used outputs of the Virgo simulation and the VLS 
(Very Large Simulation). Except for box sizes and number 
of particles, these two simulations have identical cosmology 
parameters: fl m = 0.3, Q v = 0.7, F — 0.21, h = 0.7 and 
erg = 0.9. In order to estimate measurement errors, we di- 
vide the VLS simulation into eight independent subsets each 
with the same size and geometry the original Virgo simula- 
tion. In total, we have used the resulting nine realisations to 
estimate errors. Note that we corrected for cosmic bias by 
always taking the average before ratio statistics were formed. 

Our measurements of the second and third order condi- 
tional cumulants are displayed in Figs. 1 and 2, respectively. 
Results from EPT (Eq.HITl are denoted with solid lines. The 
measurements in simulations are in excellent agreement with 
EPT, especially on on large scales ^ 20h _1 Mpc. 



5 SUMMARY AND DISCUSSION 

We presented the theory of conditional cumulants in the 
weakly non-linear regime. The unique set of statistics can be 
thought of as degenerate Appoint correlation functions, or 
integrated monopole. We have derived the generating func- 
tion of neighbour count factorial moments, revealing a deep 
connection to conditional cumulants. We introduced the re- 
duced quantity Rn, which is analogous to the cumulant 
Sn- We calculated leading order perturbation theory pre- 
dictions, and showed that results are similar to that of the 
Sjv's. However, while edge correction for CIC is not feasible, 
we proposed an accurate edge-corrected estimation method 
for the conditional cumulants. This was applied to a set of 
measurements in simulations, which yielded results in ex- 
cellent agreement with the theory, especially on large scales 
£ 20h~ 1 Mpc. The agreement with theory encourages fur- 
ther development of this statistic for high precision cosmo- 
logical applications, such as constraining bias. 

As three-dimensional galaxy catalogues are produced 
inherently in redshift space, understanding effects of red- 
shift distortions on these statistics is crucial before practical 
applications can follo w. In the dist ant o bserver approxima- 
tion, t he formula by [Kaiser (119871) and iLilie & Efstathiou 
( 1989) is expected to provide an excellent approximation 
for t/ 2 (r). According to §2, we only need to consider the 
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Figure 2. Same as Figure 1. for the third order conditional cumulant Us (left), and the reduced statistics R3 (right). 



monopole enhancement 

U a (s)= (l + \f+\/^U 2 {r) , (12) 

where / ~ fi^ 6 . This formula essentially predicts a uniform 
shift of the real space results. To test it, we repeated our 
measurements in redshift space, and found that the above is 
indeed an excellent approximation in redshift space (Fig. 3). 

Considering the relatively simple, monopole nature of 
the statistics, we expect that the overall effect on U3 should 
also be a simple shift, simila rly to the Lagrangian calcu- 
lations by iHivon e t alJ <ll 995l) and the more genera l Eule - 
rian results~bv Tscoccmiarra Couchman fc Friemanl lll999l) . 
Specifically, we propose that ratio of R3 in redshift space to 
that in real space can be approximated by 

5(2520 + 3360/ + 1260/ 2 + 9/ 3 - 14/ 4 ) 7 

98(15 + 10/ + 3/ 2 ) 2 ' 4 ' ( } 

This is motivated by the notion that the shift from red- 
shift distortions of equilateral triangles should be similar 
to the corresponding shift for our monopole statistic. Our 
simulations results (see Fig. 3) show that this simple idea 
is indeed a surprisingly good approximation, although the 
phenomenological theory based on the above formula ap- 
pears to have ~ 5% bias on scales Si 20h _1 Mpc where we 
expect that weakly non-linear perturbation theory is a good 
approximation. For practical ap plications, this bia s can be 
calibrated by iV-body, or 2LPT jscoccimarroll200d) simula- 
tions. 

In addition to the above simple formula, we have 
calculated the shift due to redshift distortions by 
angul ar averaging the bispectrum mo nopole term in 
IScoccimarro. Couchman fc Friemanl jl999l) . We have found 
that the results over-predict redshift distortions, however, 
they would agree with simulations at the 1-2% level if we 
halved the terms classified as FOG (finger of god). At the 
moment there is no justification for such a fudge factor, 
therefore we opt to use the above phenomenology, which 
is about 5% accurate. While redshift distortions of third 
order statistics are still not fully understood due to the non- 



perturbative nature of the redshift space mapping (R. Scoc- 
cimarro, private communication), detailed calculations tak- 
ing into account velocity dispersion effects will improve the 
accuracy of the redshift space theory U3. 

For applications to constrain bias, one has to keep in 
mind that redshift distortions and non-linear bias do not 
commute. However, at the level of the above simple theory, 
it is clear that one can understand the important effects at 
least for the third order statistic. There are several ways to 
apply conditional cumulants for bias determination, either 
with combinati on with one oth er statistic (CIC or cumulant 
correlators, cf. ISzapudill998ah . or using the configuration 
dependence of the more general Ra(ri,r2). One also has to 
be careful that in pr actical applications ratio statistics will 
contain cosmic bias (Sz apudi et alJll999f) . We propose that 
joint estimation with U2 and U3 will be more fruitful, even if 
R3 is better for plotting purposes. Details of the techniques 
to constrain bias from these statistics, as well we determi- 
nation of the bias from wide field redshift surveys is left for 
future work. 

Another way to get around redshift distortions is to 
adapt conditional cumulants for projected and angular- 
quantities. Such calculations are straightforward, and en- 
tirely analogous to those performed for S3 in the past. An- 
other possibl e generalisation of our theory would be to use 
halo models JCoorav fc Shetbll2002h to extend the range of 
applicability of the theory well below 20h _1 Mpc. These gen- 
eralisations are left for subsequent research. 
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